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Abstract: This research introduces a novel technique for optimizing remote Photoplethysmography 
(rPPG) networks through pruning, aiming to create compact yet robust models that operate effectively 
even with minimal training data. The method emphasizes reducing network complexity without 
compromising accuracy, specifically tailored for real-time use cases. Experimental findings illustrate 
that pruned networks achieve comparable performance to their larger counterparts, underscoring 
their viability for deployment in environments with limited resources. 


1. Introduction 


Remote Photoplethysmography (rPPG) represents a cutting-edge technology enabling 
contactless measurement of physiological signals like heart rate by analyzing subtle changes 
in skin color through a video camera. This non-invasive method offers distinct advantages 
over traditional methods, particularly in contexts requiring continuous monitoring without 
discomfort [1-26]. Its applications span healthcare, fitness, and human-computer interaction 
[27-39]. However, deploying rPPG faces challenges due to the computational demands and 
data volume needed for accuracy. Conventional rPPG networks are large and resource- 
intensive, limiting their use in real-time and on low-power devices[19, 21, 28]. 

To address these challenges, pruning—a proven deep learning technique—offers a 
solution by reducing network size while preserving performance. This approach creates 
smaller, faster models suitable for resource-constrained environments. The key lies in 
maintaining accuracy despite limited training data [40, 51, 58, 59]. 

This study proposes a novel pruning strategy tailored for rPPG networks. It begins 
with pre-training to establish a baseline, followed by strategic removal of less crucial 
parameters based on factors like weight magnitude and parameter impact. Fine-tuning 
then optimizes the pruned network for performance recovery and efficiency enhancement. 
Extensive experiments on standard rPPG datasets validate the efficacy of the approach, 
showing that pruned networks achieve performance akin to larger models while signifi- 
cantly improving computational efficiency. Moreover, these models demonstrate robust 
generalization to unseen data [21, 40, 51]. 

In essence, this research addresses a critical gap in rPPG technology by developing 
compact, efficient networks through advanced pruning techniques. By enabling high 
performance with limited data, it expands the practical application of rPPG technology 
across diverse and constrained environments, contributing to the advancement of neural 
network optimization for real-world applications. 


2. Method 


Pre-training the rPPG Network: Initially, our approach focuses on pre-training the rPPG 
network with available data, crucial for establishing a performance baseline. This phase 
employs a conventional rPPG network architecture, comprising convolutional and fully 
connected layers. The network is trained to detect subtle skin color changes in video frames, 
correlating with blood volume fluctuations and thereby heart rate. Standard techniques 
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like data augmentation and regularization bolster network robustness, crucial given the s 
dataset’s constraints. 40 

Pruning Strategy: Post pre-training and baseline establishment, the pruning phase 1 
commences, systematically identifying and removing less impactful network parameters. 42 
Key criteria include weight magnitude and parameter contribution to network output. a3 
Parameters with minimal impact on activations are targeted for removal, while sensitivity 44 
analysis ensures pruning doesn’t compromise performance. This strategy safeguards as 
critical parameters, vital for accurate heart rate measurement. 46 

Layer-wise Pruning and Fine-tuning: Pruning proceeds layer by layer, starting from a7 
input layers and advancing to outputs, simplifying complexity while monitoring per- 4s 
formance. Each pruning step is followed by fine-tuning, retraining retained parameters 49 
to mitigate any performance decline. This process optimizes the network for reduced so 
parameters, maintaining or enhancing baseline performance. 51 

Pruning Iterations: Iterative pruning involves multiple cycles of parameter removal s2 
and fine-tuning. Each iteration removes a set percentage of least significant parameters, s3 
followed by fine-tuning to preserve performance integrity. This iterative method ensures sa 
gradual model simplification while minimizing performance impact, achieving a balanced ss 
trade-off between compactness and accuracy. 56 


Table 3. PhysNet architecture specification. The 2D kernel is of 
size H x W, and the 3D kernel is of size T x H x W, where 
C,T, H, W denote channel, time, height, and width, respectively. 
The dimension of the output size is C x T x H x W. 


Name Kernel Output 
Input none 3 x T x 192 x 128 
conv2D 5x5 32 x T x 192 x 128 


maxpooling; | 1x 2x2 32 x T x 96 x 64 
conv3D,, 3x3x3 64 x T x 96 x 64 
conv3D}. 3x3x3 64 x T x 96 x 64 


maxpoolingg | 1x 2x2 64 x T x 48 x 32 
conv3D21 3x3x3 64 x T x 48 x 32 
conv3D22 3x3x3 64 x T x 48 x 32 


maxpooling; | 1x 2x2 64 x T x 24 x 16 


conv3D31 3x3x3 64 x T x 24 x 16 
conv3D3 3x3x3 64 x T x 24 x 16 
maxpooling, | 1 x 2x2 64xTx12x8 
conv3D4, 3x3x3 64xTx12~x8 
conv3D42 3x3x3 64xTx12~x8 


avgpooling 1x12x8 64xTx1x1 


conv 1x1xil 1xTx1xil 


Evaluation and Performance Metrics: The pruned network’s effectiveness is assessed 57 
using standard metrics: heart rate estimation accuracy, mean absolute error (MAE), and ss 
computational efficiency. Evaluation occurs across training and validation datasets to sə 
ensure generalizability. Inference time and memory usage quantify computational benefits. 6o 
Comparative analysis with the original network underscores trade-offs and gains from 61 
pruning. Results demonstrate the pruned network meets or exceeds benchmarks of the 62 
larger model, emphasizing speed and resource efficiency. This methodology ensures com- 63 
pact, high-performance rPPG models. Pre-training, systematic pruning, fine-tuning, and 64 
iterative refinement optimize networks for real-time and resource-constrained scenarios. es 
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Table 1. Comparison of common networks proposed for rPPG pulse extraction. 


Name | Input size #of layers #of parameters(x10°) storage (MB) FLOPs (x10°) 
DeepPhys [3] 3 x 150 x 36 x 36 9 1.46 5.70 9.62 
HR-CNN [36] 3 x 300 x 192 x 168 13 1.87 7.32 988.97 
PhysNet [49] 3 x 128 x 128 x 128 15 0.83 3.26 130.52 
MTTS-CAN [27] 3 x 150 x 36 x 36 9 1.45 5.70 9.61 
DeeprPPG [26] 3 x 120 x 128 x 64 15 0.54 2.12 26.76 
RhythmNet [52] 3 x 10 x 300 x 25 21 11.42 44.64 1.70 


Table 2. Comparison of selected publicly available datasets used for rPPG research. 


Name | # of subjects #offrames # of videos resolution average duration/video (sec) storage (GB) 
PURE [5°] 10 125,366 60 640 x 480 69.9 38.6 
COHFACE [13] 40 202,092 164 640 x 480 61.6 0.662 
ECG-fitness [36] 17 407,232 202 1920 x 1080 67.2 1044 
UBFC-1PPG [1] 42 81,401 42 640 x 480 64.4 69.8 
BUAA-MIRR [46] 13 257,339 143 640 x 480 59.9 220 
NBHR [16] 257 886,001 1130 640 x 480 32.7 921 


3. Experimental Results 
3.1. Comparative Analysis 


To thoroughly assess the pruned networks’ ability to generalize, they underwent 
rigorous testing on unseen validation data sets. Across a spectrum of scenarios, including 
varied skin tones, lighting conditions, and motions, the pruned networks consistently 
demonstrated robust performance. They maintained high accuracy levels and exhibited low 
mean absolute error (MAE), showcasing their resilience in adapting to new and challenging 
data environments. This validation process underscored that the pruning methodology 
effectively preserved the network’s capacity to handle real-world complexities without 
compromising performance. 

Comparative analyses between the pruned and original, unpruned networks provided 
deeper insights into the benefits and trade-offs of the pruning process. While the pruned 
networks exhibited a slight increase in MAE compared to their unpruned counterparts, 
this marginal trade-off was vastly outweighed by the substantial gains in computational 
efficiency. Notably, the pruned networks showed marked reductions in inference time and 
memory usage, making them exceptionally suitable for deployment in resource-constrained 
environments, such as mobile devices and wearables. 

Overall, the analysis confirmed that the pruning strategy successfully balances model 
compactness, accuracy, and computational efficiency. It validated the efficacy of pruned 
rPPG networks in optimizing for real-time applications, particularly in contexts where 
computational resources are limited. These findings solidify the potential of pruned rPPG 
networks for advancing non-contact health monitoring systems in practical, everyday 


settings. 
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